Computing the Hazard Ratios Associated With Explanatory Variables Using Machine Learning Models of Survival Data

نویسندگان

چکیده

PURPOSE The application of Cox proportional hazards (CoxPH) models to survival data and the derivation hazard ratio (HR) are well established. Although nonlinear, tree-based machine learning (ML) have been developed applied analysis, no methodology exists for computing HRs associated with explanatory variables from such models. We describe a novel way compute ML using SHapley Additive exPlanation values, which is locally accurate consistent quantify variables’ contribution predictions. METHODS used three sets publicly available consisting patients colon, breast, or pan cancer compared performance CoxPH state-of-the-art model, XGBoost. To HR XGBoost values were exponentiated means over two subgroups was calculated. CI computed via bootstrapping training generating model 1,000 times. Across sets, we systematically all variables. Open-source libraries in Python R analyses. RESULTS For colon breast comparable, showed good consistency HRs. In pan-cancer set, agreement most but also an opposite finding between result. Subsequent Kaplan-Meier plots supported model. CONCLUSION Enabling can help improve identification risk factors complex enhance prediction clinical trial outcomes.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Machine Learning Models for Housing Prices Forecasting using Registration Data

This article has been compiled to identify the best model of housing price forecasting using machine learning methods with maximum accuracy and minimum error. Five important machine learning algorithms are used to predict housing prices, including Nearest Neighbor Regression Algorithm (KNNR), Support Vector Regression Algorithm (SVR), Random Forest Regression Algorithm (RFR), Extreme Gradient B...

متن کامل

zoning of flood hazard in Nowshahr city using machine learning models

  The aim of this study is to predict and model flood hazard in the city of Nowshahr, Mazandaran province using machine learning models. The criteria and indicators affecting flood hazard were identified based on the review of resources, and then the indicators were converted into rasters in ArcGIS environment, and finally standardized by fuzzy method for use in the models. K-nearest neighbor ...

متن کامل

Genomic computing. Explanatory analysis of plant expression profiling data using machine learning.

As with every other organism whose genome has been sequenced (Hinton, 1997; Bork et al., 1998), a chief finding in plants (Bevan et al., 1999; Somerville and Somerville, 1999) is the presence of a vast number of genes (many with no relatives in the databases) whose existence, let alone function, had previously gone unrecorded. The importance of finding the function of these genes has led to wha...

متن کامل

the relationship between using language learning strategies, learners’ optimism, educational status, duration of learning and demotivation

with the growth of more humanistic approaches towards teaching foreign languages, more emphasis has been put on learners’ feelings, emotions and individual differences. one of the issues in teaching and learning english as a foreign language is demotivation. the purpose of this study was to investigate the relationship between the components of language learning strategies, optimism, duration o...

15 صفحه اول

Dust source mapping using satellite imagery and machine learning models

Predicting dust sources area and determining the affecting factors is necessary in order to prioritize management and practice deal with desertification due to wind erosion in arid areas. Therefore, this study aimed to evaluate the application of three machine learning models (including generalized linear model, artificial neural network, random forest) to predict the vulnerability of dust cent...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: JCO clinical cancer informatics

سال: 2021

ISSN: ['2473-4276']

DOI: https://doi.org/10.1200/cci.20.00172